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Abstract. Minkowski functionals provide a novel tool to characterize the large-scale 
galaxy distribution in the Universe. Here we give a brief tutorial on the basic features 
of these morphological measures and indicate their practical application for simulation 
data and galaxy redshift catalogues as examples. 
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1. Motivation 

Consider the set of points in three-dimensional space supplied by galaxy coordinates of 
a catalogue. Let us decorate each point with a ball of radius r. Our task is to measure 
the size, shape and connectivity of the spatial pattern formed by the union set of these 
balls. These characteristics change with the radius r, which may be employed as a 
diagnostic parameter as illustrated in Figure 1. 




Figure 1. 500 points extracted from a HDM simulation decorated with balls of 
different radius. 
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On the one hand, given an arbitrary starting point 

on the circle, any angle of direction within the shaded 

region, i.e. an angle from to 7r, is equally probable. 

A longer chord lies within the dark shaded region 

which encompasses an angle of 7r/3 altogether; so this 

reasoning leads to a probability of |. 
On the other hand, given an arbitrary direction, the 

chord takes any distance between zero and the circle's 

radius (shaded region) from the centre with the same 

probability; longer chords have distances from zero 

to half the radius (dark shaded region), therefore the 

probability is ^. 

Figure 2. Bertrand's paradoxon: Arbitrary chords inside a circle may be longer or 
shorter than the side of the inscribed triangle — but what is the probability for them 
to be longer? 

In 1889, Bertrand attempted to solve the problem in geometric probability theory 
shown in Figure 2 and came up with two different results. Poincare suggested 
to overcome such ambiguities by tying probability measures to geometric groups. 
Minkowski [1] took a first systematic approach setting off the evolution of the field 
of mathematics now known as integral geometry. 

2. Fundamentals 

2.1. Hadwiger's theorem 

A possible starting point for integral geometry [2] is some manifold with a group of 
transformations Q. Usually we are dealing with J-dimensional Euclidean space for which 
the natural choice is the group that contains as subgroups rotations and translations. 
One can then consider the set K, of convex bodies embedded in this space and, as an 
extension, the so-called convex ring 1Z of all finite unions of convex bodies. In order 
to characterize a body B from the convex ring one looks for scalar functionals M that 
satisfy the following requirements: 

Motion Invariance The functional should be independent of the body's position and 
orientation in space, 

M(gB) = M(B) for any g G Q, B G K. 

Additivity Uniting two bodies adds their functionals, minus the functional of the 
intersection, 

M(B 1 U B 2 ) = M{B 1 ) + M(B 2 ) - M(B 1 n B 2 ) for any B 1 ,B 2 G K. 
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Conditional Continuity The functionals of convex approximations to a convex body 
converge to the functionals of the bodyf , 

M(Ki) -> M(K) as l\ -> K for l\. I\ G k\ 

One might think that these fairly general requirements leave a vast choice of such 
functionals. Surprisingly, Hadwiger's theorem states that in fact there are only d + 1 
independent such functionals if space is (/-dimensional. In this sense the Minkowski 
functionals are unique and complete. To be more precise: 

Hadwiger's theorem [3] Let 1Z be the convex ring embedded in (/-dimensional space. 
Then there exist d + 1 functionals M^, fi = . . . d on 1Z such that any functional M on 
1Z that is motion invariant, additive and conditionally continuous can be expressed as 
a linear combination of them: 

d 

M = ^ c^Mfj,, with numbers c^. 

Throughout the literature, hardly anybody uses these quantities themselves, but 
prefers to attach some constant factors. The four most common notations are M^, V^, 
V fj, and Wfj,, defined as follows (co^ is the volume of the //-dimensional unit ball): 

^d Vv 

W := ^ M With u; .= _^i_ 

* oj d -, ^ » r(i + j/2) 

In three-dimensional Euclidean space, these functionals have a direct geometric 



Table 1. The most common notations for Minkowski functionals in three-dimensional 
space expressed in terms of the corresponding geometric quantities. 
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interpretation as listed in Table 1. Apart from figures, where we will switch to the V^- 
notation in order to get the easily interpretable Euler characteristic without a constant 
factor, we will stick to the M^-notation throughout this paper since it offers the most 
convenient representation of some central formulae. 

| This applies to convex bodies only, not to the whole convex ring. 
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Now for the geometric interpretation: The first functional equals the volume V of 
the body, the second one is the surface area A. The third functional corresponds to the 
integral mean curvature H of the body's surface and provides information about shape. 
Lastly, the fourth functional can be interpreted as the Euler characteristic x which is 
a purely topological quantity. It is related to the genus g via x = 1 ~~ 9 an d can be 
calculated using the simple formula 



2.2. The principal kinematical formula of integral geometry 

One of the central issues of integral geometry is the calculation of mean values over 
motions of bodies. In order to do this one needs some measure dg to perform integration 
on the group of motions Q. Demanding invariance of the measure under motions seems 
a sensible requirement and indeed gives a measure that is well-defined and even unique 
up to a multiplicative constant for most groups of interest. In order to visualize this so- 
called Haar measure it is best to stick to a certain parametrization of the group under 
consideration. Luckily, most applications involve calculations that can be performed 
within the framework of integral geometry and do not require an explicit representation 
of the group of motions. 

Taking two bodies A and B from the convex ring we will fix the position and 
orientation of A and allow B to move around, i.e. apply any transformation g from the 
group of motions Q. The resulting intersection A D gB is again a member of the convex 
ring and thus has Minkowski functionals Af |U (A H gB). Finally, given a Haar measure 
dg on (?, we can take the mean value over the group of motions and think of ways to 
calculate it from the functionals of A and B alone. The result is the following 
Principal kinematical formula [3] 



2.3. Minkowski functionals of a Poissonian distribution 

The pincipal kinematical formula also offers an elegant way of obtaining explicitly the 
Minkowski functionals of Poisson distributed bodies [4, 5]. The Haar measure gives 
equal weight to any position and orientation of a body in space, so an integration over 
the group of motions corresponds to taking the mean value over the Poisson distribution 
of one body K. Let us distribute N identical, yet independently distributed bodies A^-, 
each with Minkowski functionals M . . . M 3 in the region Vf. Any configuration Bn can 

| V has to be finite in order to ensure uniqueness of the Haar measure. 



X = number of components — number of tunnels + number of cavities. 
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Figure 3. Analytical and numerical Minkowski functional for a Poisson process 
plotted against the radius of balls measured in unit lengths: The shaded regions show 
the variance of a sample of 50 realizations of the Poisson distribution. Note that the 
variance is already fairly small for as few as 50 balls per unit volume. To the right, a 
unit box containing a typical configuration is shown at various radii displaying different 
topologies as explained in the text. 



be obtained by applying a motion to each of the individual bodies, and the mean values 
can be calculated by multiple integration with the product measure d/ijy: 

N N dgi f 

Bn ■= U <W := TT TTT7 with / d/ijy = 1- 

i i V Jq n 

Using this, one can explicitly calculate the mean values of the Minkowski functionals 
per volume [4, 5]. Enlarging number and volume to infinity such that the density 
p = iV/|V| remains constant one arrives at the limits: 

m = l-e~ pM \ m 2 = e~ pM ° (pM 2 - p 2 M[), 

m x = e~ pMo pM 1 , m 3 = e~ pM °(pM 3 — 3p 2 MiM 2 + p 3 Mf). 

The values are plotted, together with numerical results, in Figure 3 for a density of 50 
balls per unit volume. One can see that, as the radius increases, the volume (functional 
Vo ) is filled until reaching complete coverage at density f , while the topology undergoes 
a number of changes which can be seen in the Euler characteristic (functional V3) as well 
as in the configurations shown on the right. Very tiny balls (top left, radius .05) remain 
isolated from each other and so the Euler characteristic x lS close to the number of balls 
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Figure 4. Minkowski functionals for idealized structures are highly efficient 
discriminators. To the left, Minkowski functionals per point and unit volume are 
shown for a Poisson distribution on a line (dashed), on a plane (dotted) and inside the 
whole space (solid). In all distributions the points were seperated by the same mean 
distance of .25 unit lengths; in the example configurations to the right the points have 
a mean seperation of .05 unit lengths and are decorated with balls of the same radius. 



for the lowest radii. As the radius increases, balls join (top right, radius .1) and hence 
X decreases, even below zero as more and more tunnels are formed in the configuration 
(bottom left, radius .f5). This behaviour reaches a turning point when these tunnels 
are blocked to form closed cavities (bottom right, radius .25) which give again a positive 
contribution to the Euler characteristic. Finally, these cavities are closed one by one 
and so we end up with all space filled, and \ equal to zero. 

2Jf. Clusters, walls and filaments 

The principal kinematical formula also allows some analytical calculations for point 
processes beyond the simple Poissonian case. Since Minkowski functionals have already 
yielded some promising preliminary results [6] concerning discrimination of filamentary 
and wall-like structures in the Universe, we have calculated analytical values for the 
Minkowski functionals of idealized filaments and walls, i.e. points on an infinitely 
extended straight line distributed according to a Poisson distribution (on a flat plane, 
respectively). The results for the three distributions are shown in Figure 4 for equal 
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mean distance of the points. 
3. Applications 

Examples of the discriminatory power of the Minkowski functionals are shown in Mecke, 
Buchert and Wagner [7], Buchert [8], and Platzoder and Buchert [6]. In the following 
we discuss the application of Minkowski functionals to simulation data and to redshift 
catalogues. 

3.1. Simulations 

As an example for the analysis of simulation data we calculated the Minkowski 
functionals for a HDM simulation (using first-order Lagrangian perturbation theory), 
with h = 0.5, = 1, and primordial spectral index n = +1 in a cubic box with side 
length 200 Mpc using 64 3 particles. We extracted randomly 5 different subsamples with 
5000 points. In Figure 5 we show the mean and the standard error of the Minkowski 
functionals calculated in a periodic box. Although using only five subsamples the errors 
are very small. In this sense the Minkowski functionals are robust. Using this approach 



Vo Vi V a Vo 
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Figure 5. Minkowski functionals of a HDM simulation. 

we are able to compare simulations. A comparison with data derived from catalogues is 
only possible with a suitable galaxy or cluster extraction scheme and a treatment of the 
boundaries (see below). Since Minkowski functionals depend on the number density, 
comparisons are only possible for samples of equal number density. Currently only 
scaling relations for the Poisson process are known to us. 

3.2. Redshift catalogues 

Integral geometry offers a concise way of dealing with boundaries [4]. Let D be the 
window (the sample geometry) through which we look at N galaxies. B r = U^Lo B r (i) is 
the union of all balls B r (i) of radius r centered on the z-th galaxy respectively. One has 
to calculate the Minkowski functionals of the intersection of the union of all balls with 
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radius [km/a] radius [krn/s] radius [km/s] radius [kni/s] 

Figure 6. To the left the densities of the Minkowski functionals V M (5 r fl D)/\D\ 
within the sample window D are depicted, to the right the densities of the Minkowski 
functionals (jE?,- )/ 1 V | of the Universe. As expected, the volume density is the same 
for both sets, while the other functionals clearly exhibit the removal of boundary 
effects. The errors are bootstrap errors, we extracted ten samples with 143 galaxies 
from the 153 galaxies in the cube. Again the errors are small, we nearly gain nothing 
in extracting fifty bootstrap samples instead of ten. 



the window, Af |U (i? r fl D) and the Minkowski functionals of the window itself Af |U (D). 
Therefore, we not only have to compute the intersection of two or three balls [7], but 
we also have to compute the intersections of a ball with the boundary of D. 

The quantities Af |U (i? r fl D) are well suited for the analysis of redshift catalogues 
since we do not assume anything on the point distribution and still analyze all galaxies 
in the sample. Using the same window D one is able to compare different catalogues 
and investigate their homogeneity and isotropy. It is not necessary to impose artificial 
assumptions on the underlying point process like using periodic boundary conditions (in 
the case of a cubic sample) or any other way of embedding (e.g. Poisson). 

Although we argued in favour of not imposing assumptions on the underlying point 
process we want to show another way of extracting information from the catalogue, 
assuming homogeneity and isotropy. Then we get for the average under movements of 
the window 

{M,(B r nD))=i / M,(B r n gD)dg. 
\V\ Jg 

V again is a region significantly larger than \D\ (e.g., the whole Universe). If we 
additionally assume that within D we have a fair sample of the Universe, the averages 
greatly simplify: 

(Mfj,(B r fl D)) = Mfj,(B r fl D). 
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Using the principal kinematical formula we then conclude 

M,(B r n D) = ^ p Q M„(B r )M^(D). 

With these strong assumptions and using the computed values of Af |U (i? r H D) and 
Af |U (D), we are able to extract the densities of the Minkowski functionals Af |U (i? r )/|V|, 
thus removing the contributions of the boundary. We getf 

|V| M (D) ^ \u) \V\ M (D) ' 

These expressions are more suitable for computational purposes than the explicit 
solution of the system. 

To illustrate this procedure we calculate the density of the Minkowski functionals 
in the Universe based on the intersection of the Universe with a cube selected from the 
CfA catalog as discussed by Gott et al. [9], with side length s = 5000/v / 3^~ 1 kms~ 1 . 
We put a cube, our window D } with sidelength s = s /2 into the center of the sample 
cube. This is necessary since we calculate the Minkowski functionals for balls up to a 
radius s /6. For the window we get 

V (D) = \D\ = s 3 , V 1 (D)=s\ V 2 (D)=s, V 3 (D) = 1. 

In Figure 6 we show the densities of the Minkowski functionals V fl (B r (~}D)/\D\ including 
the contributions of the intersections with the window D } and the reconstructed densities 
of the Minkowski functionals V^(-B,.)/|V| of the Universe i.e., the Minkowski functionals 
after removing the contributions of the window D. Again we want to emphasize that no 
assumptions enter in calculating V^Bj. D D)/\D\, but the reconstruction of V^(-B,.)/|V| 
is relying on homogeneity and isotropy. 

This is meant as an example; this cube is certainly not a fair sample. 
4. The software and its future 

The ANSI C code for computing the Minkowski functionals of balls in a 
box with periodic boundary conditions may be obtained via e-mail from 
buchert@stat.physik.uni-muenchen.de. The algorithm follows the description in 
Mecke, Buchert, and Wagner [7] and calculates the measures. Memory requirements 
are moderate (less than 5 MByte for samples of up to 10,000 points), while CPU time 
strongly depends on the number of points and on the clustering properties of the point 
distribution. A typical example is a Poisson distribution of 1,000 points, which took ap- 
proximately 40 minutes on a HP 715/80 workstation. A parallelized version using pvm3 

| We use the convention ^i=o a i = 0- 
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and a version dealing with boundaries in the described way are in the test stage. Addi- 
tionally we are working on a program for structure discrimination using the analytical 
formulae for filaments, walls, and clusters. 
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